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Abstract 

We argue that discourse plans must capture the in- 
tended causal and decompositional relations between 
communicative actions. We present a planning algo- 
rithm, DPOCL, that builds plan structures that prop- 
erly capture these relations, and show how these struc- 
tures are used to solve the problems that plagued previ- 
ous discourse planners, and allow a system to participate 
effectively and flexibly in an ongoing dialogue. 

Introduction 

The close connection between discourse and intention is 
by now nearly universally accepted: generating discourse 
is an intentional activity, the structure of discourse re- 
flects the structure of the participants' intentions, and 
understanding discourse involves, at least in part, rec- 
ognizing the intentions of the language producer. Re- 
searchers working both on generation and interpretation 
are wont to exhibit "discourse plans" that represent the 
intentions of language users. However, there has been 
much confusion about exactly what constitutes a dis- 
course plan, and what kind of algorithms should process 
them. Most of the work in computational linguistics has 
built on plan representations and planning algorithms 
that are at least a decade old — representations and al- 
gorithms that suffer from being unprincipled and diffi- 
cult to analyze. These difficulties have spilled over into 
the NL systems that rely on them. Yet within the past 
few years, the literature on AI planning has grown sig- 
nificantly, and the older representations and algorithms 
have been reanalyzed and replaced with cleaner repre- 
sentations and algorithms whose formal properties are 
amenable to careful analysis. 

In this paper, we illustrate some of the problems 
that arise from using these old plan representations 
and planning algorithms. We then show how more 
recent planning algorit hms, called partial-order causa l 
link (POCL) planners (|McAUister and Rosenbhtt, 1991 



Penberthy and Weld, 1991 ), can be used to generate 
discourse plans. The particular planning algorithm we 
use is DPOCL, an algorithm that intro duces action de- 
composition into a POCL framework ( Young, Pollack 
and Moore, 1994). We show that the discourse plans 



both the intended causal and decompositional relations 
among the communicative actions, and thereby solve 
the problems of earlier systems and allow a language- 
processing system to participate effectively and flexibly 
in an ongoing dialogue. 

Previous Approaches 

Discourse is typically viewed as having a hierarchical 
structure and therefore many discourse planners are 
based on the original N OAH (Sacerdoti, 1977) mode l 
of hierarchi c al planning ( Appelt, 1985 ; [Cawsey, 1993| ; 



produced by the DPOCL algorithm properly capture 



Hovy, 1991| ; pVIaybury, 1992| ; [Moore and Paris, 1993] ). 
These systems rely on customized planning algorithms 
with procedural semantics for the purposes of solving 
speciflc text-planning problems. The informal construc- 
tion of these systems and their application to particular 
problems have resulted in successful text generation for 
limited domains and text types, while obscuring the un- 
desirable properties of the algorithms. However, careful 
analysis of these programs shows that there is nothing 
in their semantics to prevent them from generating in- 
correct plans, generating plans with redundant steps, or 
failing to find plans in situations where they exist. To 
the extent that these planners have been able to avoid 
these problems, they have done so by severely limiting 
the expressive power of action descriptions and/or re- 
quiring the designer of action descriptions to handcraft 
each description to fit correctly into the ad hoc semantics 
of the specific plan for which the action is intended. 

Within the planning literature, it has been noted that 
there are two different ways in which component ac- 
tions of a plan may be related: an action ACTl may 
provide causal support for another action ACT2 (i.e., 
ACTl establishes a precondition of ACT2) or an ac- 
tion ACTl may be part of the decomposition of ACT2. 
Similar distinctions have been n oted in the NL liter- 
ature, e.g., Grosz and Sidner's (198£) distinction be- 
tween satisfaction-precedence and dominance and Pol- 
lack's (1990) distinction between enablement and gener- 
ation. 

The main problem with most previous discourse plan- 
ning systems is that they have not adequately repre- 
sented both the causal and decompositional relations 
between actions in a discourse plan. That is, they do 
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Figure 1: Schematic Discourse Plan Illustrating a Re- 
dundant Step 
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Figure 2: Schematic Discourse Plan Illustrating Par- 
ent/Subplan Effects 



not reason about interactions between the effects of ac- 
tions in the plan. More specifically, they do not reason 
about the establishment of preconditions, or the possi- 
bility that one step in the plan may accidentally undo 
or obviate the effect of another step. Moreover, in cases 
where they perform decomposition, they do not reason 
about the relationship between the effects of actions in 
a subplan and the effects of their parent action. 

To illustrate two of these problems, consider the dis- 
course plans shown schematically in Figures [I] and |. 
In these figures, conditions, denoted by the Ci, appear- 
ing to the left of an action denote its preconditions 
and those appearing to the right of an action denote 
its effects. These plans have structure that is typi- 
cal of those produced by most previous discourse plan- 



ning systems ( 


Cawsey, 1993 




Hovy, 1991; 


Maybury, 1992; 


Moore and Paris, 1993). Fi^ 


jure 111 shows a plan where the 



effect cl is established by two different actions occurring 
in different subtrees of the plan. This can occur because 
these planners do not consider the roles that previous ac- 
tions' effects can play in satisfying the preconditions of 
subsequent discourse actions.|^ Thus, they cannot detect 
when an action added to establish one particular condi- 
tion may serendipitously satisfy conditions of other steps 
in the plan. This failure may lead to the generation of 
texts that are (unintentionally) redundant or repetitive. 
An analogous, and possibly even more damaging, prob- 
lem may result when these systems fail to notice that 



^Appelt (1985) would solve this problem with critics, 
i.e., ad hoc procedures that check for certain types of plan 
interactions. 



one action undoes the effect of another. 

Figure ^ shows a plan where there is no explicit con- 
nection between the effects established by the parent ac- 
tion (c4 and c5) and those established by its subplan (c6 
through cll). Previous approaches only represent the 
relationship between actions at different levels; they fail 
to capture the relationship between the effects of those 
actions. In Figure the top-level goal is c4Ac5. Suppose 
that c6 unifies with c4, and that c8, c9, and clO together 
have a consequence that unifies with c5. In this case, c7 
and cll are side effects of choosing the decomposition 
of the PARENT- ACTION into ACTIONl, ACTI0N2 
and ACTI0N3. This fact, however, is not captured in 
the discourse plan of Figure |^. Hence a system relying 
on this plan could not distinguish intended effects from 
side effects, and so would be unable to determine that 
the failure of c6 warrants a different response than the 
failure of c7. 

In short, these systems cannot, in general, determine 
how discourse actions are related to one another. Yet, as 
wc will illustrate in the next section, understanding the 
intended relations between discourse actions is crucial to 
effective language generation. 

The Significance of Discourse Plans 

Consider the following sample discourse, a fragment of 
a political discussion between two participants, Sharon 
(S) and Harry (H). 

S: Wiggins will vote no on NAFTA. 
She's an ally of the unions. Her 
district is heavily industrial. 

A plausible and typical analysis of this discourse is 
that Sharon's primary intention is to convince Harry that 
Wiggins will vote no on NAFTA. To achieve this goal, 
Sharon asserts the proposition in question (that Wiggins 
will vote no on NAFTA) and then supports it by claiming 
that Wiggins is an ally of the unions. To convince Harry 
of this later claim, Sharon supports it by claiming that 
Wiggins's district is heavily industrial. 

Now consider these possible alternative responses by 
Harry to Sharon's statement: 

HI: I didn't think her district was in- 
dustrial. 

H2: Lots of representatives from in- 
dustrial districts vote against the 
union. 

H3: Well, she's certainly pro-union, 
but I didn't think her district was 
industrial. 

H4: Well, she's certainly pro- 
union, but lots of representatives 
from industrial districts support 
NAFTA. 



H5: I didn't think her district was in- 
dustrial. And besides, lots of rep- 
resentatives from industrial dis- 
tricts support NAFTA. 

How is Sharon to determine an appropriate response 
to these replies? As we have pointed out (Moore and 



Paris, 1993; Moore and Pollack, 1992), Sharon's response 
must take account of what Harry's reply reveals about 
which parts of Sharon's discourse plan were successful. 

For example, in HI Harry's failure to believe that Wig- 
gins 's district is industrial blocks the support that this 
claim would have provided to convince Harry of Wig- 
gins's pro-union position. At this point Sharon has sev- 
eral options. She may try to convince Harry that Wig- 
gins's district is, in fact, industrial. Alternatively, she 
may find some other support for the claim that Wig- 
gins is pro-union or she may find some other means to 
support the anti-NAFTA claim altogether. 

Implicit in Sharon's initial statement was her belief 
that Harry believed that, as a rule, a representative's po- 
sition on labor is determined by the industrial make-up 
of her district. Together with Sharon's claim that Wig- 
gins 's district is industrial, this rule would have provided 
support to convince Harry of Wiggins 's pro- union posi- 
tion. H2 indicates that the support for Sharon's claim 
that Wiggins is pro-union has failed. Sharon must ei- 
ther find an alternate discourse strategy for supporting 
it or must find some other means to support the anti- 
NAFTA claim. Notice the difference between HI and H2. 
A plausible response to HI, but not H2, is to reestablish 
the proposition that Wiggins's district is industrial. 

In H3 as in HI, Harry expresses doubt that Wig- 
gins comes from an industrial district. However, he 
also indicates that he believes that Wiggins is pro-union. 
Sharon's intention to get Harry to believe that Wiggins's 
district is industrial was not achieved. Consequently we 
may infer that her intention to get him to believe that 
Wiggins is pro-union also failed. However, Sharon need 
not try to provide alternate support for either her pro- 
union or anti-NAFTA claims. This is because Sharon's 
intention to convince Harry that Wiggins's district is in- 
dustrial was held in service of the intention to get Harry 
to believe in Wiggins's pro-union position. That is, there 
was a causal connection between the industrial-district 
intention and the pro-union intention; because Harry's 
response explicitly indicates that the pro-union inten- 
tion was achieved, the outcome of those intentions which 
served as preconditions to it or as effects in subplans of 
it can be ignored.^ 

Responses H4 and H5 are variations of HI through H3. 
Their analysis is left to the reader. 



^If, on the other hand, it is important to Sharon (for some 
other reason) that Harry also believe that Wiggins comes 
from an industrial district, then she may need to reconvince 
him of this. 



As can be seen in these examples, a wide range of re- 
sponses to Harry's replies are possible. Each of Harry's 
replies provides feedback about the outcome of a small 
subset of Sharon's intentions. In order to respond ap- 
propriately, Sharon must be able to determine what im- 
plications this feedback has on the ultimate success of 
her other intentions. 

A Discourse Plan for Our Example 

We now describe how the DPOCL system represents 
Sharon's utterance under the analysis given above; see 
Figure ||. 

The manner in which a hearer combines the informa- 
tion in an utterance with his prior beliefs is critical to 
the generation of the utterance. Most previous work has 
made use of highly simple models of this process: for 
instance, it has assumed that the effect of asserting a 
proposition p is either that the hearer believes or does 
not believe p. In fact, a speaker may go to great lengths 
to convince the hearer of the truth of a proposition. She 
may first assert it, then support it, and then provide 
support for the intermediate statement. In such a case, 
the speaker presumably believes that the combination of 
utterances is what leads the hearer to accept the main 
proposition. A complete model of this phenomenon is 
beyond the scope of this paper; we hint at it by repre- 
senting the combination of multiple partial beliefs with 
the action Combine-Belief (x) , where a; is a vector of rel- 
evant beliefs. The strength of belief L that a hearer has 
in a particular proposition P is represented informally 
by the formula Bel(P, L). 

In Figure H] we abbreviate propositions as follows: N 
represents Wiggins will vote No on NAFTA, U represents 
Wiggins is pro-union and / represents Wiggins comes 
from an industrial district. Those conditions surrounded 
by boxes are true in the initial state - causal link arcs 
connecting them to the initial state are omitted for clar- 
ity. 

The DPOCL data structure for representing plans con- 
sists of five components: 

• Steps: Each discourse action in the plan is repre- 
sented by a step. These steps are the nodes in the 
plan graph. Steps are instantiated from action oper- 
ators representing the action's preconditions and ef- 
fects. Steps may be composite, representing abstract 
actions like Cause-to-Believe(C/), or primitive, repre- 
senting those actions that are directly executable by 
the system, such as Inform(/). 

• Decomposition Links: Decomposition links con- 
nect a parent step to the initial and final steps of the 
subplan that achieves the parent step's effects. The 
decomposition links are shown using dashed arcs; they 
capture the hierarchical structure of the plan. 
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Figure 3: An Example Diseourse Plan 



• Causal Links: Causal links connect two steps when 
the first step establishes a precondition for the second 
step. They are shown using solid arcs and are labeled 
with the effects that they contribute. 

• Ordering Constraints: The set of ordering con- 
straints defines a partial temporal ordering over the 
steps in the plan. For readability, these constraints 
are not shown in Figure |[ 

• Binding Constraints: The set of binding con- 
straints provide codesignation relationships for vari- 
ables occurring in the steps of the plan. For read- 
ability, all variables in the plan shown in Figure ^ 
have been replaced by object constants specified by 
the plan's binding constraints. 

DPOCL uses the standard technique of encoding the 
initial conditions and the goals of a planning problem as 
the effects of a null initial action and the preconditions 
of a null final action, respectively. Similarly, in an ac- 
tion decomposition, there is a null initial action that has 
as its effects exactly the preconditions of its parent ac- 
tion, and a null final action that has as its preconditions 
the effects of its parent. The DPOCL planner attempts 
to achieve the preconditions of a subplan's final step in 
the same manner as it achieves all other unsatisfied pre- 
conditions. In this way we guarantee that the effects of 
every composite action are achieved by the steps in its 
subplan. Furthermore, the exact relationship between 
the actions in a subplan and the establishment of those 
effects is made explicit. 

By analyzing the causal and decompositional structure 
of the discourse plan, we can determine an appropriate 
response for each of Harry's replies discussed above: 

• HI: In this case, the effect Bcl(/, L9) asserted by 
the Inform(/) was not achieved. From our represen- 
tation, it is possible to trace a path of failed effects 
from Inform(/) across causal links and up decompo- 
sitional finks to Cause- To-Believe(/), Support(C/) and 
eventually to Cause- To-Believe(iV). Using this infor- 
mation, the system can determine that appropriate re- 
sponses to HI can be generated by trying to convince 
Harry that Wiggins's district is, in fact, industrial (i.e, 
replanning the subtree rooted at the node Cause- To- 
Believe(/), most likely by providing support for /), 
finding some other support for the claim that Wig- 
gins is pro-union (i.e, replanning the subtree rooted at 
the node Support({/)), or finding some other means to 

support thp a.nti-NAFTA p1ai 



an appropriate response results from the re-planning 
of subtrees whose execution is affected by this fail- 
ure. Specifically, those subtrees rooted at Cause- To- 
Believe(causes(/, U)) across causal links and up de- 
compositional links to Support (f/) and eventually to 
Cause- To-Believe(A^). Note that this does not include 
the subtree rooted at Cause-to-Believe(/) and thus, 
unlike in HI, the system will not attempt to reestablish 
the proposition that Wiggins's district is industrial. 

• H3: In this case, the speaker is given more informa- 
tion about the success of the original plan. As in HI, 
the effect Bel(/, L9) is not achieved. However, here 
Harry also indicates that the effect Bel([/, L4) of the 
step Cause- To-Believe(C/) has been achieved. Cause- 
To- Believe (t/) lies along the only causal path from 
Inforni(/) to the plan's final step. Since it achieved 
its intended effect, re-planning any of its subplans is 
unnecessary. 

Although this example did not explicitly illustrate how 
our representation addresses cases where action descrip- 
tions have multiple effects, it is clear our model can han- 
dle such cases appropriately. Our solution rests on the 
fact that our model makes a clear distinction between ef- 
fects of discourse actions that play a role in achieving the 
top-level goals of the discourse plan and effects that are 
not important for achieving the agent's ultimate goals 
(i.e., side effects). 

How DPOCL Creates Discourse Plans 

So far we have focused on the representation used by 
DPOCL. We now briefly describe how the DPOCL algo- 
rithm works. In DPOCL, the process of creating a com- 
pleted plan involves iterating through a loop that chooses 
between refining the current plan decompositionally (ex- 
panding a composite action by adding its subactions to 
the plan) or refining the plan causally (choosing some ac- 
tion's unsatisfied precondition and adding a new action 
and the causal link establishing it) . Figure ^ summarizes 
the DPOCL planning algorithm. For more details of the 
algorithm and a discussion of its formal properties, see 



(Young, Pollack and Moore, 1994). 

The representation of each action is separated into 
two parts corresponding to the causal and decomposi- 
tional roles the action plays: the action operator, and 
a possibly empty set of decomposition operators. The 
action operator captures the action's preconditions and 
effects. These preconditions and effects are sets of first- 
order quantified sentences similar to the typical precon- 
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aim altngpthpr (i p rppla.n- dition and add/delete lists of STRIPS ( Fikes and Nils- 



1971) 



• H2: The DPOCL plan in Figure g is predicated on the 
truth of Bel(causes(/, J7), L8), i.e., that this proposi- 
tion is an effect of the initial step. In H2, Harry reveals 
that this proposition is false. As in the previous case. 



Each decomposition operator represents a 
single-layer expansion of a composite step, essentially 
providing a partial specification for the subplan that 
achieves the parent step's effects given its preconditions. 
In addition to specifying the steps in the subplan, the de- 
composition operator specifies any variable binding and 



temporal ordering constraints between the steps, and the 
causal links between steps of the subplan that enable 
them to establish the parent step's effects. 

The formal specification of the DPOCL algorithm 
relies on nondeterministic choice to guide its search 
through the space of partial plans. Each choice is 
recorded, and backtracking occurs when appropriate. 
Nondeterministic choice is specified in order to allow 
DPOCL implementations to specify domain-dependent 
search control. As long as search control heuristics guar- 
antee that all possible choices will be explored, the im- 
plementation remains complete. 

As a result of adding steps to a plan, newly created 
steps may introduce threats to existing causal links. A 
step A threatens a causal link between two steps B and 
C when A might occur between B and C and one of 
A's effects might undo the condition established in the 
causal link. To ensure that no causal links are undone, 
each threat is dealt with before planning proceeds, ei- 
ther by ordering the steps so that the threatening step 
cannot occur between the two causally-linked steps or by 
restricting the variable bindings of the steps to eliminate 
harmful interactions. This process is iterative, since each 
modification to resolve a threat may introduce new ones. 

In the example discussed earlier, the DPOCL planner 
is invoked with the partial plan consisting of the null ini- 
tial action S-1, whose effects are Bel(causes([/, iV), L7) 
and Bel(causes(/, [/), L8), and the null final action S- 
16, whose only precondition is Bc\{N,L3). By iterating 
through the DPOCL loop shown in Figure IJ, the plan 
shown in Figure |^ is constructed. This plan is completed, 
that is, the preconditions of all actions have been estab- 
lished by causal links, there are no threats to any of these 
links, and all composite actions have been decomposed 
into subplans terminating in executable actions at the 
leaf nodes. 

This plan makes explicit the causal connections be- 
tween each effect and the precondition that relies upon 
it. Similarly, the decomposition links make explicit the 
manner in which actions in a subplan establish the effects 
of the parent step. This representation makes it possible 
for a system playing the role of Sharon to respond appro- 
priately to each of Harry's replies as described earlier. 

Conclusions 

In this paper, we have presented a structure for dis- 
course plans that draws on state-of-the-art AI planning 
research. Both the plan representation and the discourse 
planning algorithm that we use to construct it have a 
well-defined semantics whose formal properties can be 
analyzed ( Young, Pollack and Moore, 1994 ). Further, we 
have shown how DPOCL discourse plan structures can 
be used for determining appropriate responses to utter- 
ances that indicate a failure of some part of the discourse 
plan. 
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Termination: If the plan is inconsistent, then backtrack. 
Otherwise, remove unused step and return tlie plan. 

Plan Refinement: Non-deterministically do one of the fol- 
lowing: 

1. Causal Planning: 

(a) Goal Selection: Nondeterministically select a goal. 

(b) Operator Selection: Add a step to the plan that adds 
an effect that can be unified with the goal (either by 
instantiating the step from the operator library or by 
finding a step already in the plan). If no such step ex- 
ists, backtrack. Otherwise, add the binding constraints 
required for the conditions to unify, an ordering con- 
straint that orders the new step before the goal step 
and add the causal link between the two. 

2. Decompositional Planning: 

(a) Action Selection: Nondeterministically select some 
unexpanded composite step in the plan. 

(b) Decomposition Selection: Nondeterministically 
chose an appropriate decomposition schema for this ac- 
tion whose constraints are satisfied. Add the steps and 
subplan components of the decomposition schema to the 
plan and update the list of decomposition links to indi- 
cate the new subplan. 

Threat Resolution: Find any step that might threaten to 
undo any causal link. For every such step, nondeterministi- 
cally do one of the following: 

• Promotion If possible, move the threatened steps to occur 
before the threat in the plan. 

• Demotion If possible, move the threatened steps to occur 
after the threat in the plan. 

• Separation If possible, add binding constraints on the 

steps involved so that no conflict can arise. 

Recursive Invocation Call the planner recursively with the 
new plan structure. 



Figure 4: The DPOCL Algorithm 



